Access Global AI Models - Power Next-Gen Apps

From General to Specialized AI - All Models in One Platform

LLM Tools：

Model Comparison Cost Calculator Arena Open Source Models

Release Date

Input Price

Output Price

Filter

Service Provider

Classification

Capabilities

Context Length

85 models match the criteria

Release Date

Input Price

Output Price

Gemini 2.0 Flash

Text GenerationMultilingualTool Call

Gemini 2.0 Flash is a multimodal AI model launched by Google and is a sub - model of the Gemini 2.0 suite. It has the capabilities of text understanding, image generation and editing, supports a context window of 1 million tokens, and its response speed is twice as fast as that of 1.5 Pro. It is suitable for scenarios such as advertising design, social media content creation, and educational illustration generation. Developers can access it through Google AI Studio and the Gemini API.

Claude Haiku 4.5

Text generationMultilingualTool Call

Claude Haiku 4.5 is a small hybrid inference AI language model launched by Anthropic. Its performance is close to that of the medium-sized model Sonnet 4, and its cost is only one-third of it, with the inference speed more than doubled. It has the ability to process a context of 200,000 tokens, supports multimodal prompts, and has an AI security level of ASL-2. It is suitable for real-time response scenarios such as intelligent customer service, programming assistance, and conversational assistants, and can be integrated through the Claude application, API, and major cloud platforms.

Gemini 2.5 Flash

Text GenerationMultilingualTool Call

Gemini 2.5 Flash is a lightweight multimodal AI model launched by Google. It supports text, image, audio, and video inputs, has adaptive inference capabilities, and improves token usage efficiency by 20 - 30%. It is suitable for high-throughput, low-latency tasks such as translation, classification, and multimodal interaction, and is open to developers and enterprise users.

Claude Sonnet 4.5

Text Generation

Claude Sonnet 4.5 is a mid - range balanced AI model released by Anthropic in September 2025. It belongs to the "medium - sized" product in the Claude series, positioning as a compromise between performance and cost. It has outstanding programming capabilities, with a score of 77.2% in the SWE - bench Verified test, supports continuous programming for over 30 hours, and can build production - level applications. It also has both efficient reasoning and visual processing capabilities, with a fast response speed and moderate cost, suitable for software development, complex intelligent agent construction, and enterprise - level tasks.

Gemini 2.5 Flash Lite

Text generationMultilingualTool Call

Gemini 2.5 Flash - Lite is a lightweight AI inference model (preview version) launched by Google, featuring ultra - fast response and cost optimization. It is the fastest Gemini model currently. It supports multimodal input, a 1 - million - token context, and Google's native tools (such as search and code execution). It is suitable for high - throughput, low - latency scenarios (such as translation and classification) and provides API services for developers.

Qwen3 Omni Flash Realtime

Full modalityMultilingual

Qwen3-omni-flash-realtime is a real-time full-modal AI model launched by Tongyi Qianwen of Alibaba. It supports multimodal processing of text, images, audio, and video, and has real-time interaction capabilities such as streaming conversations and mid-way interruption. It can be applied to scenarios such as voice assistants, multimedia analysis, and intelligent editing, and supports 119 text languages and 20 voice interactions.

Qwen3 Tts Flash Realtime

Speech synthesisMultilingual

Qwen3-TTS-Flash-Realtime is a real-time text-to-speech model launched by Tongyi of Alibaba. The first packet delay is 97ms. It supports 17 timbres, 10 languages, and 17 dialects. The speech is natural and fluent. It is suitable for scenarios such as intelligent customer service, audiobooks, AI teachers, and film and television dubbing.

Doubao 1.5 Pro 32k

Text generationTool Call

Doubao-1.5-pro-32k is a product in the ByteDance Doubao large model series, which is a large AI model. It uses a sparse MoE architecture, featuring low hardware costs, high inference efficiency, and strong multimodal capabilities. It supports visual understanding and real-time voice interaction. In evaluations of knowledge, code, reasoning, etc., it outperforms models such as GPT-4o. It is suitable for scenarios such as copywriting generation and intelligent interaction.

Doubao Seedance 1.0 Pro

Video generation

Doubao-Seedance-1.0-pro is a large video generation model launched by ByteDance. It supports text-to-video and image-to-video generation, with an output resolution of 1080P and a maximum duration of 10 seconds. Features: Seamless switching between multiple shots, natural dynamic effects, stable subject movement, fast generation speed (about 40 seconds for a 5-second video), and low cost (about 3.6 yuan for a 5-second 1080P video). It is applied in scenarios such as creative content production, marketing videos, and social entertainment, and is available through platforms such as Jimeng AI and Volcengine.

Hunyuan T1 Latest

Inference modelChinese, English

Hunyuan-T1-latest is a large deep inference model launched by Tencent in March 2025. It adopts the Hybrid-Transformer-Mamba MoE architecture and has a parameter scale in the trillions. It has super-strong long-text capture, mathematical/logical reasoning, and code generation capabilities. The decoding speed is 60 - 80 tokens/s. It supports API calls and is suitable for scenarios such as complex problem solving, scientific computing, and AI search.

Inference modelTool Call

DeepSeek-V3.1 is a large language model released by the Chinese AI company DeepSeek in August 2025. It adopts a hybrid inference architecture and a 671 billion parameter MoE design, supporting the switch between "thinking" and "non-thinking" dual modes, and unifying general dialogue, complex reasoning, and coding capabilities. Its agent capabilities are enhanced and can be used for tool usage, multi-step reasoning, and programming assistance. The API has been opened, and an MIT open-source license is provided, making it suitable for scenarios such as agent development and financial risk control.

Tencent Hunyuan Video Generation

Video GenerationChinese, English

Tencent Hunyuan Video Generation is an AI video generation and processing technology service launched by Tencent. Based on multi-modal fusion technology, it supports functions such as video special effects, stylized conversion, and image dynamicization. Features include high-coherence motion generation and accurate semantic understanding. It is suitable for scenarios such as short video creation, advertising and marketing, and educational content production, which can lower the threshold for professional production and improve content production efficiency.

Text generation

GPT-5 Mini is a lightweight language model launched by OpenAI, focusing on high efficiency and low cost. It is suitable for structured tasks such as form filling, data extraction, and standardized content generation. It reduces the demand for computing resources, enabling small and medium-sized enterprises to afford high-quality AI services. It balances performance and cost and is a reliable choice for teams with limited budgets.

Claude Opus 4.1

Text GenerationMultilingualTool Call

Claude Opus 4.1 is a top - tier large language model developed by Anthropic and is the core engine of the Claude family. It features excellent long - text processing capabilities (with a context of over 200,000 characters) and strong complex reasoning abilities. It scored 74.5% on the SWE - bench coding test, supports multimodal input, and uses Constitutional AI technology to ensure safety. It is suitable for professional scenarios such as enterprise - level document analysis, code refactoring, and academic research.

Text generationChinese, EnglishTool Call

GLM-4.5-Flash is an open-source foundation model released by Zhipu AI. It uses a Mixture of Experts (MoE) architecture, provides dual inference modes of thinking/non-thinking, and supports tool invocation and multi-framework compatibility. It is suitable for agent development, code generation, and complex reasoning. The MIT license allows commercial use, offering high cost-effectiveness.

Text generationChinese, EnglishTool Call

GLM-4.5-AirX is a lightweight hybrid inference large model launched by Zhipu AI, with a total of 106 billion parameters (12 billion active parameters). It adopts the MoE architecture, natively integrates inference, encoding, and agent capabilities, and supports both thinking (complex reasoning/tool use) and non-thinking (instant response) modes. It is suitable for agent development, local deployment, and multilingual processing. It is open-source and commercially available under the MIT license.

Text generationChinese, EnglishTool Call

GLM-4.5-Air is a lightweight large language model with hundreds of billions of parameters launched by Zhipu AI of Tsinghua University. It adopts the MoE architecture (106 billion total parameters / 12 billion active parameters), focusing on hybrid inference capabilities and supporting the switching between the thinking mode for complex tasks and the non-thinking mode for quick responses. The model files of its quantized versions (such as 4-bit AWQ) are only 64GB, suitable for local deployment and edge devices, balancing performance and efficiency. It is suitable for intelligent agent development, tool invocation, and scenarios with limited resources. It has been open-sourced and its API interface is available.

Text GenerationChinese, English

Spark X1 is a deep inference large model released by iFlytek in January 2025. It is trained on a fully domestic computing power platform. Its core features include: supporting mathematical reasoning across all school levels (covering primary, junior high, high school to the AIME competition), optimizing the hallucination problem with multi - path sampling verification technology, and having multi - language processing capabilities. Its application scenarios have been implemented in fields such as education (AI learning machines, teacher assistants) and healthcare (auxiliary diagnosis systems). It completed an iterative upgrade in July 2025 to further enhance reasoning accuracy and industry adaptability.

Hunyuan TurboS Latest

Text generationChinese, English

Hunyuan-TurboS-latest is a new-generation fast-thinking large model launched by Tencent Hunyuan. It is an ultra-large Hybrid-Transformer-Mamba MoE model. It features a fast response speed, with the word output speed doubled and the latency of the first word reduced by 44%. It performs outstandingly in knowledge, mathematics, and creation, etc. On multiple publicly available benchmarks commonly used in the industry, it demonstrates results comparable to those of industry-leading models such as DeepSeek V3, GPT 4o, and Claude 3.5 in multiple fields including knowledge, mathematics, and reasoning. It is suitable for real-time AI application scenarios such as 3D modeling, video special effects production, and voice interaction.

Text generationMultilingualTool Call

DeepSeek-V3 is an AI model released in March 2025, with 685 billion parameters, belonging to the large language model. Features: Strong inference and programming capabilities, such as solving AIME competition questions and generating efficient code; Low cost, supporting operation on consumer-grade devices; Open source under the MIT license. Suitable for scenarios such as web development and mathematical reasoning.

Text GenerationMultilingual

Qwen3-0.6B is a lightweight causal language model released by Alibaba DAMO Academy. It has 0.6B parameters, a 28-layer network structure, and the GQA grouped query attention mechanism, supporting a super-long context of 32k. It features intelligent switching between dual modes, low resource consumption, and can be deployed on consumer-grade devices. It is suitable for scenarios such as AI assistants on edge devices, lightweight intelligent customer service, and offline browser conversations.

Spark Medical Large Model Lite

Text generationChinese

iFlytek Spark Medical Large Model - Lite is a lightweight medical AI model with capabilities such as medical knowledge Q&A, professional document generation, and diagnosis recommendation. It supports multi - round interaction and is suitable for assisting doctors in diagnosis and treatment, optimizing hospital processes, and patient health management.

Doubao 1.5 Thinking Vision Pro

Visual understandingTool Call

Doubao-1.5-thinking-vision-pro is a multimodal AI model with visual understanding and deep thinking capabilities. It supports text - image and voice interactions and is suitable for professional field reasoning and creative tasks.

Text generationMultilingualTool Call

Gemma 3n E2B is a multimodal edge AI model launched by Google. It supports local processing of text, images, audio, and video. Its performance is comparable to that of a model with 5 billion parameters. The low-memory and low-power consumption design is suitable for smartphones and wearable devices, enabling privacy protection and real-time interaction.

Gemma 3n E2B Instructed

Text generationMultilingualTool Call

Gemma 3n E2B Instructed is a lightweight multi-modal model developed by Google DeepMind. Based on the MatFormer architecture, it can run with only 2GB of memory. It supports text, image, audio, and video processing and is suitable for local deployment on edge devices such as mobile phones and Raspberry Pi. It can be used in scenarios such as chatbots, content generation, and multi-modal data extraction.

Pangu NLP N2 Reasoner 128K 5.0.0.1

Inference modelChinese, EnglishTool Call

Pangu-NLP-N2-Reasoner-128K-5.0.0.1 is a large NLP model in Huawei's Pangu series. It is an AI model based on logical reasoning, supporting a context length of 128K. It can access text datasets such as pre - trained text and multi - round Q&A, and is suitable for scenarios such as intelligent customer service, text parsing, and industry knowledge retrieval. It provides API interfaces to support multi - language development.

Claude Sonnet 4

Text generationMultilingual

Claude Sonnet 4 is a general-purpose large language model in the Claude 4 series launched by Anthropic in May 2025. It is positioned as a high-performance balanced model, supporting text/image input and a 200K context window, and uses a dynamic hybrid inference mechanism to balance efficiency and cost. Its features include outstanding coding efficiency (72.7% on SWE-bench), fast response speed, and support for multi-tool invocation. It is suitable for scenarios such as programming assistance, AI Agent development, daily development, content generation, and data analysis. Basic functions are open to free users.

Text GenerationChinese, English

Tencent Hunyuan Turbo is a new-generation large language model released by Tencent in September 2024. It adopts the MoE (Mixture of Experts) architecture, with a total parameter count reaching the trillion level. Its features include a 100% improvement in inference efficiency and a 50% reduction in cost. It benchmarks against GPT-4o in tasks such as mathematical reasoning and text creation, and supports AI search internet plugins and the SearchGPT function. It is mainly applied in nearly 700 internal business scenarios at Tencent, such as Tencent Cloud, QQ, and WeChat Reading, and is open to enterprises through the Tencent Cloud API.

Pangu MM M2 AIGVideo 1.0.0

Video generationChinese, English

Huawei Pangu Image-to-Video Model Pangu-MM-M2-AIGVideo-1.0.0 supports multi-ratio 5-second video generation and 96-frame continuation, deployed with 8 inference units

Image generationChinese, English

CogView-4 is an open-source text-to-image model released by Zhipu in 2025. It supports bilingual input in Chinese and English and can generate images at any resolution. It uses the GLM-4 encoder to enhance Chinese semantic understanding. It achieved the first comprehensive score in the DPG-Bench test, reaching the open-source SOTA level. It follows the Apache 2.0 license and is suitable for scenarios such as advertising design and educational illustrations.

AIBase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2026AIBase